Solving the accuracy-diversity dilemma via directed random walks 
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Random walks have been successfully used to measure user or object similarities in collaborative filtering 
(CF) recommender systems, which is of high accuracy but low diversity. A key challenge of CF system is 
that the reliably accurate results are obtained with the help of peers' recommendation, but the most useful 
individual recommendations are hard to be found among diverse niche objects. In this paper we investigate the 
direction effect of the random walk on user similarity measurements and find that the user similarity, calculated 
by directed random walks, is reverse to the initial node's degree. Since the ratio of small-degree users to large- 
degree users is very large in real data sets, the large-degree users' selections are recommended extensively by 
traditional CF algorithms. By tuning the user similarity direction from neighbors to the target user, we introduce 
a new algorithm specifically to address the challenge of diversity of CF and show how it can be used to solve the 
accuracy-diversity dilemma. Without relying on any context-specific information, we are able to obtain accurate 
and diverse recommendations, which outperforms the state-of-the-art CF methods. This work suggests that the 
random walk direction is an important factor to improve the personalized recommendation performance. 



PACS numbers: 89.20.Hh, 89.75.Hc, ()5.70.Ln 
I. INTRODUCTION 



Due to the rapidly expanding internet and social network, 
we are overloaded by the unlimited information on the World 
Wide Web [1]. For instance, one has to choose among thou- 
sands of candidate commodities to shop online, and finds the 
relevant information from billions of Web pages. Compre- 
hensive exploration for each user is infeasible ||2|]. Conse- 
quently, how to efficiently help people obtain information that 
they truly need is a challenging task nowadays A land- 
mark for information filtering is the use of search engine, by 
which users could find the relevant web pages with the help of 
properly chosen keywords flUt]. However, sometimes users' 
tastes or preferences evolve with time and can not be accu- 
rately expressed by keywords, and search engines do not take 
into account the personalization and tend to return the same 
results for people with far different needs. Being an effective 
tool to address this problem, recommender systems have be- 
come a promising way to filter out the irrelevant information 
and recommend potentially interesting items to the target user 
by analyzing their interests and habits through their historical 



"jianguo. liu@sbs.ox.ac.uk 



behaviors ll3ll6l- [l2ll . Motivated by its significance in economy 
and society, the design of an efficient recommendation algo- 
rithm becomes a joint focus of theoretical physics lfl3l UM . 
computer science JH and management science (H . 

Zhang et al. lfl3ll proposed a new information frame- 
work based on the heat conduction process, namely heat- 
conduction-based (HC) recommendation model. HC model 
supposes that the objects one user has collected have the rec- 
ommendation power to help the target user find potentially 
relevant objects. If the target user is replaced by a specific 
object, HC model is similar with the collaborative filtering 
(CF) method, in which the users rated one target object have 
the recommendation powers to identify the potentially inter- 
esting users. So far, CF method has been successfully ap- 
plied to many online applications and has becomes one of the 
most successful technologies for recommender systems |0- 
[l2l [T5I [l6ll . For example, Herlocker et al. ifnll proposed an 
algorithmic framework referring to the user similarity. Luo et 
al. 11811 introduced the concepts of local and global user sim- 
ilarity based on surprisal-based vector similarity and the con- 
cept of maximum distance in graph theory. Sarwar et al. lfl9ll 
proposed the item-based CF algorithm by comparing different 
items. Deshpande et al. 112011 proposed the item-based top-A" 
CF algorithm, in which items are ranked according to the fre- 
quency of appearing in the set of similar items and the top- A" 
ranked items are returned. Gao et al. [21] incorporated the 
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FIG. 1: (Color online) Illustration of random- walk-based user sim- 
ilarity calculation, which has been used to measure user or object 
similarity in personalized recommendation, (a) The possibility walk- 
ing from user ua to user ub is used to measure the directed user 
similarity sba = 1/8. (b) Similarity from user us to user ua is 
sab = 1/4. The degrees of user ua and ub are k UA — 4 and 
k UB = 2, and one has = 



user ranking information into the computation of item similar- 
ity to improve the performance of item-based CF algorithm. 
Recently, some physical dynamics, including random walks 
ifiol I22I, l23ll and the heat conduction process lll3ll . have found 
their applications in node similarity measurement. Liu et al. 
ifToh used the random walks to calculate the user similarity and 
found that the modified CF algorithm has remarkably higher 
accuracy. As a benchmark for comparison, we call it standard 
CF algorithm (hereinafter, CF only stands for the collabora- 
tive filtering using random-walk-based user similarity iflfJl '). 
By considering the high-order correlation of the users and ob- 
jects, Zhou et al. Jll] and Liu et al. [11] proposed the ultra ac- 
curate algorithms, in which the second-order correlations are 
used to delete the redundant information. Besides reliably ac- 
curate recommendations, it is also important for recommender 
systems to help most individuals find diverse niche objects. 
CF algorithms generate recommendations according to simi- 
lar users' suggestions, which prefers to rank the popular ob- 
jects at the top positions of recommendation lists, leading to 
high accuracy but low diversity. 

Random walks have been used to quantify trajectories in a 
symmetric ways, namely in- and out-diversity and accessibil- 
ity 1I24I - I26I1 . In this paper, We argue that the opinions of small- 
degree users should be enhanced to generate diverse recom- 
mendations, and present a new directed random-walk-based 
CF algorithm, namely NCF algorithm, to investigate the ef- 
fect of user similarity direction on recommendations. The nu- 
merical results on the data sets, Netflix and Movielens, show 
that the accuracy of NCF outperforms the state-of-the-art CF 
methods with greatly improved diversity, which suggests that 
the similarity direction is an important factor for user-based 
information filtering. 



H. BIPARTITE NETWORK AND 
HEAT-CONDUCTION-BASED MODEL 

A recommendation system consists of a set of nodes, ob- 
ject nodes and connections between two nodes correspond- 
ing to an object voted or collected by a user, which could 
be represented by a bipartite network G(U,0, E). We de- 
note the object set as O — {01, 02, . . . , o m }, the user set 
as U = {ui, U2, ■ . . , u n } and the connection set as E = 
{ei, e2, • • • , e q }. The bipartite network can then be repre- 
sented by an adjacent matrix A = {a a j} G R m ' n , where 
a a j = 1 if o a is collected by Uj, and a a j = otherwise. 

The final aim of recommender systems is to identify a given 
users' interesting objects and generate a ranking list of the 
target user's uncollected objects according to the predicted 
scores. HC model supposes the neighbor nodes of one tar- 
get node as the heat sources with temperature 1, while the 
remaining nodes are of temperature 0. According to the ther- 
mal equilibrium [13], the temperature of the remaining nodes 
are set as the predicted scores, which could be calculated by 
solving the equation W h H = f, where f is the flux vector 
lfl3tl . The standard HC model firstly constructs the propagator 
matrix W , where the element w a p denotes the conduction 
rate from object op to o a , and set the temperatures of target 
node's neighbors as 1, then the heat will diffuse between heat 
sources and other nodes. Finally, the temperatures of uncol- 
lected objects are considered as recommendation scores. 

The general framework of the item-based HC model is as 
follows: (i) construct the weighted object network (i.e. deter- 
mine the matrix W) from the known user-object relations; (ii) 
determine the initial resource vector f for each user; (iii) get 
the final resource distribution via 

f" = Wf, (1) 

(iv) recommend those uncollected objects with highest final 
scores. Note that the initial configuration f is determined by 
the user's personal information, thus for different users, the 
initial configuration is different. For a given object o a , the 
ith element of f a should be zero if a a i = 0. That is to say, 
one should not put any recommendation power (i.e. resource) 
onto an unrated user. The simplest case is to set a uniformly 
initial configuration as 

ft = a al . (2) 

Under this configuration, all the users rated object o a have the 
same recommendation power. 

The traditional HC model is implemented based on the ob- 
ject similarity. If the number of users is much smaller than 
the one of objects, we could apply the similar idea based on 
the user similarity, namely user-based HC model. From the 
definition, one can find that user-based HC model is equiva- 
lent to the CF algorithm, therefore, the user similarity anal- 
ysis of the CF algorithm could also bring deep insight into 
the HC model. Zhou et al. l23ll used the random walk pro- 
cess to calculate the node similarity of bipartite networks and 
proposed the network-based inference (NBI) recommendation 
algorithm B27I1 . In Ref. 11411 . NBI algorithm was also referred 
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FIG. 2: (Color online) User degree distributions for Movielens 
and Netflix data sets, which approximately have exponential forms 
P(k) ~ exp(-0.0054fc ± 0.003). 



TABLE I: Basic statistics of the tested data sets. 



Data Sets Users 


Objects 


Links 


Sparsity 


MovieLens 1,574 


943 


82,580 


5.56 x 10" 2 


Netflix 10,000 


6,000 


701,947 


1.17 x 10~ 2 



Ui and Uj, their similarities could be written as 

Sij = -r- Ei=i Promuj to U*; 

7 m a a < 4 > 

s 3i = I^lXi^ 1 , Fromujtowj. 
Therefore, one has 



as Probs algorithm. Liu et al. II 1 Oil embedded the random walk 
process into the CF algorithm to calculate the user similarity 
and found that the algorithmic accuracy is greatly increased. 
In Ref. IllOtl . a certain amount of resource is associated with 
each user, and the weight represents the proportion of the 
resource Uj would like to distribute to itj. This process works 
by using the random walk process on user-object bipartite 
networks, where each user distributes his/her initial resource 
equally to all the objects he/she has collected, and then each 
object sends back what it has received to all the users who 
collect it. The weight stj representing the amount of initial 
resource Uj evenly transferred to u, can be defined as 



k 



1=1 



(3) 



where k U( and k Q[ indicate the degrees of user u,; and object 
oi. 

In the random walk process, the user similarity from user 
Uj to Ui, Sij, is determined by the degrees of commonly rated 
objects and user Uj's degree k u . . It is unlikely these quantities 
are exactly the same for each pair of users, therefore, sy ^ Sji 
in most cases. 



III. EFFECT OF USER SIMILARITY DIRECTION ON CF 
ALGORITHM 



If k Ui >k Uj , then Sij>Sji and vice versa. For Movielens 
and Netflix data sets, the exponential forms of user degree 
distributions indicate that most users' degree are very small 
(See Fig. [2]), which means that large-degree users would fre- 
quently be identified as small-degree users' friends. As a con- 
sequence, most users' recommendation lists would be similar. 

In order to investigate the effect of user similarity direction 
on CF algorithms, we introduce a new user similarity direc- 
tion generated by random walks from neighbor set U n to the 
target user to measure user similarities and calculate the pre- 
dicted score Vi a . The NCF algorithm could be described as 
follows: Firstly, calculate directed user similarities according 
to Eq.[3] then calculate the predicted scores for target user «,-'s 
uncollected objects by 



E 



(6) 



where f3 is a tunable parameter to investigate the effect of sim- 
ilarity strength on the algorithmic performance, and Sy is the 
similarity from user Uj to m. When (3 = 1, all the user sim- 
ilarities are given the same weight; when j3 > 1, the prefer- 
ences of users with larger similarities are strengthened; when 
P < 1, the ones with smaller similarities are strengthened. 
The numerical results indicate that changing the user similar- 
ity direction could not only accurately identify user's interests, 
but also increase the algorithmic capability of finding niche 
objects. 



According to the random-walk-based user similarity calcu- 
lation (See Fig. [TJ, the user similarity is also calculated by 
random walks in a symmetric way. In CF algorithm, the sys- 
tem should identify the target user's interesting objects with 
the help of his neighbors' historical selections or collections. 
Therefore, after we obtain the user similarity matrix, the sim- 
ilarities from neighbors to the target user are used to evaluate 
the predicted scores. According to Eq.[3] to one pair of users 



IV. MAXIMAL-SIMILARITY-BASED CF ALGORITHM 

The algorithmic performance may be affected by the user 
similarity direction, and may also be determined by the prop- 
erties of the data set. In other words, although the algorithmic 
performance of the NCF algorithm is much better than the 
one of the CF algorithm, it may only happen on specific data 
sets whose similarities from neighbors to the target user are 
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TABLE II: Performances of NCF, CF and MCF algorithms for Net- 
fiix and Movielens data sets according to each of five metrics. The 
popularity (k), diversity S, precision P and recall R corresponding 
to L = 10. 





(r) 


(k) 


S P R 




NCF 


0.0450 2506 0.8236 0.0967 0.1640 


Netflix 


CF 


0.0497 


2813 


0.7001 0.0917 0.1365 




MCF 


0.0477 


2758 


0.7378 0.0954 0.1374 




NCF 


0.0864 


237 


0.8929 0.1502 0.2037 


Movielens 


CF 


0.1037 


275 


0.8435 0.1497 0.2010 




MCF 


0.0970 


271 


0.8434 0.1459 0.1936 



TABLE III: Algorithmic performances for Movielens data when 
p = 0.9, including the average ranking score (r), diversity S 
and popularity (k) corresponding to length of recommendation list 
L — 50. CRM is a global ranking method; CF is the collabora- 
tive filtering algorithm based on random walks 1 10]; Heter-CF is a 
modified CF algorithm, in which the user similarity is defined based 
on the mass diffusion process and the second-order similarity is in- 
volved 09 O pt = -0.82) (ntkCB-CF refers to the CF algorithm on 
weighted bipartite network 11511 : Hybrid is an abbreviation of the hy- 
brid algorithm proposed in Ref. fl4ll . Each number is obtained by a 
averaging over five ten runs with independently random division of 
training set and probe set. 

Algorithms Ranking score Popularity Hamming Distance 



CRM 


0.1390 


259 


0.398 


CF 


0.1063 


229 


0.750 


Heter-CF 


0.0877 


175 


0.826 


CB-CF 


0.0914 


148 


0.763 


Hybrid 


0.0850 


167 


0.821 


NCF 


0.0864 


178 


0.801 



more effective than the ones in the opposite direction. In or- 
der to make it clear, we present a maximal-similarity-based 
CF (MCF) algorithm to investigate the influence of similarity 
magnitude, in which the predicted score from user Uj to the 
uncollected object o a , Vi a , is given by 



/ j a — -\ -Jinax 



(7) 



where s max is defined as the larger similarity between user 
and Uj 



V. SIMULATION RESULTS 



A. Data Description 



In this paper, we base our simulation results on two data 
sets. The Movielens [37] data set consists of 100,000 ratings 
from 943 users on 1,574 movies(objects) and rating scale from 
1 (i.e., worst) to 5 (i.e., best). The Netflix data set 12811 is a 
random sample of the whole records of user activities in Net- 
flix.com, which consists of 6,000 movies and 10,000 users and 
824,802 ratings. The users of Netflix also vote movies by dis- 
crete ratings from one to five. Here we apply a coarse graining 
method: a movie is considered to be collected by a user only 
if the rating is larger than 2. In this way, the Movielens data 
has 82,580 edges, and the Netflix data has 701,947 edges (See 
Table I for basic statistics). As an online movie recommenda- 
tion web site, Movielens invites users to rate movies and, in 
return, makes personalized recommendations and predictions 
for movies the user has not already rated, under-contribution 
is common. Unlike Netflix web site, MovieLens does not have 
any DVD rental service. The data set E is randomly divided 
into two parts E = E T U E p , where the training set E T 
is treated as known information, containing p percent of the 
data, and the remaining 1 — p part is set as the probe set E p , 
whose information is not allowed to be used for prediction. 



B. Metrics 

1. Average ranking score 

Accuracy is one of the most important metric to evaluate 
the recommendation algorithmic performance. An accurate 
method will put preferable objects in higher places. Here we 
use average ranking score [23] to measure the accuracy of the 
algorithm. For an arbitrary user Ui, if the object o a is not 
collected by user Ui, while the entry u.i-o a is in the probe set, 
we use the rank of o a in the recommendation list to evaluate 
accuracy. For example, if there are 8 uncollected objects for 
user and object o a is ordered at the 3rd place, we say the 
position of o a is 3/8, denoted by ri a = 0.375. Since the 
probe entries are actually collected by users, a good algorithm 
is expected to give high recommendations to them, leading 
to a small ri a . Therefore, the mean value of the positions, 
averaged over all the entries in the probe set, can be used to 
evaluate the algorithmic accuracy. 



(8) 



(r) 



n 



{ui ,o a 



l£EP ■ 



(9) 



For example, set the similarity from ui to Uj is Sji = 0.01, 
while the one from Uj to Ui is = 0.9. When recommending 
objects to u i7 the larger similarity 0.9 is used regardless of the 
similarity direction. 



where E p is the edge set existing in the probe set and q is the 
number of objects in the probe set. The smaller the average 
ranking score, the higher the algorithmic accuracy, and vice 
verse. 
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2. Precision and Recall 

Since users usually only consider the top part of the rec- 
ommendation list, a more practical metric is to consider the 
number of users' hidden links ranked in the top-L places. We 
adopt another accuracy measure called precision. For a tar- 
get user, the precision is defined as the ratio between relevant 
objects (namely the objects collected by Uj in the probe set) 
and the length L. Averaging the individual precisions over all 
users, we obtain the mean value P(L) of the algorithm on one 
data set, 

PI=-V^, (10) 

i 

where di(L) indicates the number of relevant objects existing 
in the top-L places of the recommendation list. A larger pre- 
cision corresponds to a better performance. Recall is defined 
as the ratio between the number of objects existing in the top- 
L places of the recommendation list and the total number of 
collected objects Cj in the probe set. Averaging over the indi- 
vidual recalls, we obtain the mean recall R(L), which could 
be defined as 



Netflix Movielens 




P P 

FIG. 3: (Color Online) The average ranking score (r) vs. j3 for NCF, 
CF and MCF algorithms. The optimal /3 op t of NCF for Movielens 
data set, corresponding to the minimal (r) = 0.086, is /3 op t = 3.3, 
the one for Netflix data set is /3 op t = 2.0 corresponding to (r) = 
0.0450. When /3 = 1, the algorithm degenerates to the accuracy 
of the CF algorithm based on the new user similarity direction. All 
the data points are averaged over ten independent runs with different 
data-set divisions. 



4. Popularity 



i 

The larger recall corresponds to the better performance. 



3. Diversity 

The analysis results on Facebook data set showed that, be- 
sides the common interests, users of online social networks 
also have their specific tastes and interests Il29ll . leading to di- 
verse selection behaviors. Liu et al. BOll found that users' 
tastes on Movielens and Netflix data could also be divided 
into two categories: common interests and specific interests. 
Therefore, besides accuracy, the diversity of all recommen- 
dation lists is taken into account to evaluate the algorithmic 
performance. In general, most of the users would not show 
negative altitude to popular objects, therefore, ranking popu- 
lar objects at the top part of recommendation lists would gen- 
erate higher accuracy. However, personalized recommenda- 
tion algorithms should not only present accurate prediction 
but also generate different recommendations to different users 
according to their specific tastes or habits. The diversity can 
be quantified by the average Hamming distance, 

S=l-(Q ij (L))/L, (12) 

where L is the length of the recommendation list and Qij is 
the number of overlapped objects in itj's and Uj's recommen- 
dation lists. The largest 5 = 1 indicates recommendations to 
all users are completely different, in other words, the system 
has the highest diversity. While the smallest 5 = means all 
of recommendations are exactly the same. 



An accurate and diverse recommender system is expected 
to help users find the niche or unpopular objects which are 
hard for them to identify. The metric popularity is introduced 
to quantify the capacity of an algorithm to generate unex- 
pected recommendation lists, which are defined as the average 
collected times over all recommended objects 

w = ^E(i E fc 4 (13) 

where Of is Ui's recommendation list with length L. A 
smaller average degree (k), corresponding to less popular ob- 
jects, are preferred since those lower-degree objects are hard 
to be found by users themselves. 



C. Simulation Results 

We summarize the results for NCF, CF and MCF algo- 
rithms, as well as the metrics for Movielens and Netflix data 
sets in Table II. Clearly, NCF outperforms the classical CF 
and MCF algorithms over all five metrics, including average 
ranking score (r), diversity 5, popularity (fc), precision P and 
recall R. Table III gives the comparisons among different al- 
gorithms for p = 0.9. The so-called optimal parameters are 
subject to the lowest average ranking score (r). The metrics, 
including average ranking score (r), diversity 5 and popular- 
ity (k), are obtained at the optimal parameters. From which 
one can see that the accuracy of NCF is close to the result 
of hybrid algorithm ITill and outperforms the state-of-the-art 
CF algorithms using the second-order correlation information 
ifTTll . Among all algorithms, Heter-CF algorithm gives the 
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FIG. 4: (Color online) Performances of the NCF, CF and MCF algorithms for MovieLens and Netflix data sets when recommendation lists 
equal to L = 10. (a)-(d) show average object degrees (k) vs. and diversity S vs. 0. At the optimal cases, both popularity (fc) and diversity 
S of NCF are much better than the ones of CF and MCF algorithms, (e)-(h) Precision P and recall R vs. for Netflix and Movielens data 
sets. One can find that both of P and R of NCF for Movielens are larger than the ones of CF and MCF algorithms, while the precision P for 
Netflix is close to the one of CF algorithm and the recall R is much better than the results of CF algorithm. All the data points are averaged 
over ten independent runs with different divisions of training-probe sets. 



highest diversity, while CB-CF algorithm generates the lowest 
popularity. Comparing with these two outstanding algorithms, 
NCF can reach or closely approach the best diversity without 
taking into account high-order correlation, and provide more 
accurate recommendation results. 

Figure 3 reports the algorithmic accuracy as a function of 
j3. In our algorithm, the curve (the red circle) has a clear min- 
imum around f3 opt =3.3 for Movielens and /3 opt = 2.0 for 
Netflix. Comparing with CF algorithms whose user similar- 
ities are defined from large-degree to small-degree users, the 
average ranking score (r) of NCF is reduced from 0.497 to 
0.045 for Netflix and from 0.1037 to 0.0864 for Movielens 
data set, reduced by 9.9% and 16.68% respectively at the op- 
timal values. Comparing with the MCF algorithm, the per- 
formance of NCF is also better. Subject to the accuracy, the 
reason why NCF outperforms CF and MCF indeed lies in the 
direction effect but not the data effect, and the results also in- 
dicate that giving more recommendation power to the small- 
degree users could enhance the accuracy and diversity simul- 
taneously. 

The Hamming distance S is introduced to measure the al- 
gorithmic performance to present personalized recommenda- 
tion lists. The average object degree (k) is used to evaluate 
the ability that an algorithm gives a novel recommendation. 
Figure 4 (a)-(d) show (k) and S as a function of (3 when rec- 
ommendation list length L = 10 respectively. For the Movie- 
lens data set, at the optimal point /3 opt = 3.3, the popular- 
ity (k) = 237, which is reduced by 13.8%, and the diversity 
S = 0.8929 is improved by 5.9% comparing with the ones 
of CF at its optimal value. When the list length L = 10, the 
popularity (k) and diversity S of NCF are reduced by 10.9% 
and 17.64% for Netflix data set. From which one can find 
that the NCF algorithm using the new directed random walks 
has the capability to find the niche objects, leading to diverse 
recommendations. 



In general, NCF outperforms CF as well as MCF in terms 
of the accuracy (r), diversity S and popularity (k). However, 
in reality, users only care about the top part of the recommen- 
dation list. From Fig. 4 (e)-(h), one can find that, compar- 
ing with the results of CF and MCF algorithms, the precision 
P and recall R of NCF are also very good. When L = 10 
with the optimal parameter corresponding to the lowest rank- 
ing score, the precision P is approximately improved 3.0% 
and 5.5%, and the recall R is roughly enhanced by 5.2% and 
20. 15% for Movielens and Netflix data sets respectively. 

Since the similarities generated by the random walk pro- 
cess from small-degree to large-degree users are larger than 
the ones from the opposite direction, the simulation results 
indicate that enhancing the small-degree users' recommenda- 
tion powers increases the prediction accuracy, and helps users 
find niche objects, leading to more diverse recommendations. 
Figure investigates the correlation between the target user 
degree k u and its neighbors' average degree (fc™) as well as 
deviation D(k u ), where the target user's neighbors U n are de- 
fined as the users who have at least one common rated object 
with the target user, which could be obtained from the adja- 
cent matrix A. Denoting the user correlation matrix as C nscY , 
we have C uscr = AA T . The element C" sor means the num- 
ber of common rated objects between user Ui and Uj. Given 
a matrix T = {Uj} G R n ' n , with Uj = 1 if C™ 50r > 0, and 
tij = if Cj" scr = 0. The number of correlated neighbors 
k^ for a target user u could be given as k c u — X)J=i *«3> men 
average degree (fc™) of correlated neighbors U n is defined by 
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FIG. 5: (a)-(b) The average degree of the target user's neighbors 
{k™} and the corresponding deviation D(k u ) vs. target user degree 
kn for Netflix data set. (c)-(d) show the results for Movielens. To 
small-degree users, both their neighbors' average degree (fc") and 
the deviation D are very large. As k u increases, (fc™) and D(k u ) 
would decease correspondingly. 




The deviation D(k u ) could be given as 



D(k u ) 



\ 



1 n 



t k- 



(15) 



Figure [5] shows that when k u is very small, both neighbors' 
average degree (k") and deviation D(k u ) are very large, 
which means that for Movielens and Netflix data sets, the 
small-degree users would like to commonly rate objects with 
small-degree users and large-degree users. As k u increases, 
both (fc™) and D(k u ) would decrease correspondingly, which 
means that the large-degree users only commonly rate objects 
with small-degree users. According to our previous analysis, 
if the user similarities from neighbors to the target user are 
enhanced, the effects of the small-degree users would be em- 
phasized to match both large-degree and small-degree users' 
common and specific interests, which is the reason why di- 
rected random-walk-based user similarity is effective. 



FIG. 6: Improvements of the average ranking score (r), average ob- 
jects degree (k) and diversity S to different sparsity of the training 
set for Movielens data set. All the data points are averaged over ten 
independent runs with different data set divisions. 



optimal parameter subject to the lower ranking score. The im- 
provement function /((r)) of the present algorithm is defined 

as 



/((r» 



(r>CF - (TVncf 



{r)cF 



(16) 



For the popularity (k) and Hamming distance S, the improve- 
ment functions are defined as 



/(W)= Wc wct >NCF - 
f(S) = Sn % f Scv - 



(17) 



VI. EFFECTS OF DATA SPARSITY 

We investigate the effects of the data sparsity on the per- 
formance. Since we focus on the similarity direction effect 
on the CF algorithm, we choose the classical CF algorithm 
for comparison. In the simulation work, we select pE edges 
as training set, and set the rest (1 — p)E edges as probe set. 
Lower p means less information is used to generate the recom- 
mendations. The numerical results for Movielens are shown 
in Fig. 6. Each point of the histogram is obtained with the 



Figure 6 shows that the improvement of average ranking score 
(r) decreases as the size of the training set decreases, which 
may be come from the fact that the number of neighbors 
would decrease and less information could be used to predict 
the target user's interests. We also found NCF performs much 
better than CF for denser datasets. The improvement of di- 
versity f(S) decreases with the increasing size of training set, 
and f((k)) increases with more denser training set, which in- 
dicate that, generally speaking, users prefer to select popular 
objects as they give more ratings. 
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VII. CONCLUSION AND DISCUSSIONS 

In this paper, by tuning the random walk direction from 
neighbors to the target user to calculate the directed user sim- 
ilarity, we investigate the physics of directed random walks 
and their influence on the information filtering of user-object 
bipartite networks. Simulation results indicate that the new 
CF algorithm using the new directed random walks outper- 
forms state-of-the-art CF methods in terms of the accuracy, 
as well as the Heter-CF algorithm in terms of diversity si- 
multaneously. Meanwhile NCF has much better capability 
to present more accurate and diverse recommendations than 
CF algorithms whose user similarities are calculated from the 
target user to neighbors. The accuracies of NCF algorithms 
are close to the results of the hybrid algorithm (3, and the 
diversities are also increased dramatically. 

CF algorithms are one of the most successful information 
filtering algorithm and has been extensively used on lots of 
web sites, such as Netflix and Amazon. HC model also has 
been successfully used for information filtering. If we sup- 
pose the users rated one specific objects are the heat source, 
CF algorithms are equivalent to the user-based HC model. Al- 
though random walks have been used to improve the user or 
object similarity measurement fiol 0], the reason why di- 
rected similarity could enhance the information filtering per- 
formance is missing. Since the idea of the CF algorithm is 
combing neighbors' opinions of the target user to predict his 
interests or habits, we always suppose that the CF algorithm 
would like to converge users' interests and present popular 
objects. But the analysis in this paper indicates that if small- 
degree users' recommendation powers are increased CF al- 
gorithm also could solve the accuracy-diversity dilemma. In 
most of the online social systems, the number of small-degree 
users is always much larger than large-degree ones. Ac- 
cording to the random-walk-based user similarity definition, 
we know that similarities from small-degree users are always 
larger than the reversed ones. Therefore, in the CF algorithm, 
the opinions of the large-degree users would be recommended 
to most of small-degree users, leading to lower diversity. By 



tuning the similarities from neighbors to the target user, we 
could emphasize the recommendation powers of small-degree 
users and enhance the accuracy and diversity simultaneously, 
which indicates that the similarity direction is an important 
factor for information filtering. Although the idea of this pa- 
per is simple, the remarkable simulation results indicate that, 
to generate accurate and diverse recommendations, we only 
need to change the direction without changing the framework 
of the existing CF systems. 

The directed random walk process presented in this pa- 
per indeed has been defined as a local index of similarity 
in link prediction Oil 13211 . community detection [33] and so 
on. Meanwhile, a number of similarities, based on the global 
structural information, have been used for information filter- 
ing, such as the transferring similarity [12] and the PageR- 
ank index 14J], communicability 113411 and so on. Although the 
calculation of such measures is of high complexity, it's very 
important to the effects of directed random walks on these 
measures. The hybrid algorithm lfl4ll is also a kind of item- 
based CF algorithm where the item similarity is measured by 
combining the random walk and heat conduction processes to- 
gether. Lii et al. [35] proposed an improved hybrid algorithm 
by embedding the preferential diffusion process into hybrid 
algorithm. Qiu et al. 13611 proposed an improved method by 
introducing an item-oriented function to solve the cold-start 
problem. In this paper, we find that the direction of random 
walks is very important for information filtering, which may 
be helpful for deeply understanding of the applicability of di- 
rected similarity. 
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